The design and use of reference data sets for testing scientific software
نویسندگان
چکیده
A general methodology for evaluating the accuracy of the results produced by scientific software has been developed at the National Physical Laboratory. The basis of the approach is the design and use of reference data sets and corresponding reference results to undertake black-box testing. The approach enables reference data sets and results to be generated in a manner consistent with the functional specification of the problem addressed by the software. The results returned by the software for the reference data are compared objectively with the reference results. Quality metrics are used for this purpose that account for the key aspects of the problem. In this paper it is shown how reference data sets can be designed for testing software implementations of solutions to a broad class of problems arising throughout science. It is shown how these data sets can be used in practice and how the results provided by software under test can properly be compared with reference results. The approach is illustrated with three examples: (i) mean and standard deviation, (ii) straight-line fitting, and (iii) principal-components analysis. Software for such problems is used routinely in many fields, including optical spectrometry.
منابع مشابه
Design, Construction and Evaluation of an Anthropomorphic Head Phantom for Assessment of Image Distortion in Stereotactic Radiosurgery Planning Systems
Introduction: In recent years, the use of magnetic resonance (MR) images in radiation treatment planning has drawn considerable attention. However, although the extent of a tumor can be determined in great detail on MR images, the geometric accuracy of these images is limited by distortions stemming from the inhomogeneity of the static background magnetic field, the nonlineari...
متن کاملTHE P REPARATION AND EVALUATION OF REFERENCE LEISHMANIN FROM LEISHMANIA MAJOR FOR USE IN MAN FOR DIAGNOS TIC AND EXPERIMENTAL PURPOSES
Cell-mediated immunity (CMI) plays an important role in resistance against leishmaniasis. Delayed-type hypersensitivity (DTH) reaction measured by skin testing is a practical method of evaluation of CMI and is used as an aid to diagnosis and for epidemiological assessment of exposure to leishmanial infection. Skin testing in leishmaniasis, generally known as Montenegro or leishmanin test, r...
متن کاملراهاندازی روشهای آنالیتیک برای ارزیابی مختصات واکسن هپاتیت- B نوترکیب
Background: Hepatitis B vaccination has been included in routine immunization of all individuals according to WHO recommendations since 1991. Despite successful coverage, 3-5% of recipients fail to mount a desirable protection level of Ab. Vaccine failure results from: emergence of mutation, immune failure of individuals, decrease in vaccine potency, and etc. The quality of Hepatitis B va...
متن کاملImprovement of effort estimation accuracy in software projects using a feature selection approach
In recent years, utilization of feature selection techniques has become an essential requirement for processing and model construction in different scientific areas. In the field of software project effort estimation, the need to apply dimensionality reduction and feature selection methods has become an inevitable demand. The high volumes of data, costs, and time necessary for gathering data , ...
متن کاملImproving Architectural Design Skills with Design-Based Learning of New Structures
The purposeful and applied learning of Structures as a pillar of architectural design is very important. The current educational content of Structures in architecture departments is based on theoretical discussions, mathematical formulas, and lecture-oriented material. As a result, students are incompetent in applying practical concepts and structural formal analyses to architectural design. Ef...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998